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Abstract 

The efficiency of quantum state tomography is discussed from the point of view of quantum 
parameter estimation theory, in which the trace of the weighted covariance is to be minimized. 
It is shown that tomography is optimal only when a special weight is adopted. 



1 Introduction 

Let C(W) be the set of linear operators on a Hilbert space T~L — C 2 , and let S := {t x | x — 
(x 1 , x 2 , x 3 ) £ X} be the set of strictly positive density operators on T-L parametrized by the Stokes 
parameters x £ X := {x £ W. 3 \ (x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 < 1} as 

t x := -(J + xVi +x 2 a 2 +x 3 a 3 ), (1) 

where <Jx,<J2,<Jz are the Pauli matrices. Suppose we have an unknown quantum state r = t x £ S. 
We are interested in identifying the true value of the parameter x. 
Let 

S 

M {s) {H) := {(M 1 ,M 2 , . . . , M a ) \ M, £ C(H), Ah > 0, ^ M t = 1} 

i=l 

be the set of positive operator-valued measures (POVMs) on H taking values on a finite set 
of outcomes labeled by {l,2,...,s}, and let M(U) = U~ X MW(^). Given POVMs M = 
(Mi, M2, . . . , M S1 ), N = (Ni, N2, . . . , N S2 ), and a real number p between and 1, we can gen- 
erate a new POVM by a randomized combination of them as follows: 

pM®(l-p)N := (pMi,.. . ,pM sl ,(l-p)N 1} . . . , (l-p)N S2 ) £ M(H). 

We can repeat this randomization procedure inductively to obtain i=1 i>i-&fW g A4(H), where 
MW.Mf 2 ),...,!^ £ M{H) and Pi > 0(1 < i < k) such that Y,LiPi = L We sha11 cal1 
©i=i PiM^ a random measurement when M^ 2 \ . . . , M^ fc ' € A4(7i) are all projection- valued 

measurements (PVMs). Applying a random measurement means applying one of the projection- 
valued measurement {MW}i<i<fc chosen at random according to the probability distribution p = 
(Pi)i<i<$l 

Let M^ 2 \ be projection- valued measurements given by the spectral decomposition 

of the observables ax, a 2 , 03, respectively, and let M (T) := |(M (1) ® M (2) © M (3) ) be their random 



1 Helstrom [2] defined a random measurement based on a different type of convex structure of M(H) as (pMi + 
(1 — p)Ni, . . . ,pM s + (1 — p)N s ). Our definition of random measurement is seemingly different from his. 



measurement according to the uniform distribution. Suppose that, among m applications of 
to the unknown state r x , the pth PVM M^' has been chosen m M times and the outcomes ±1 have 
been observed times, where m = mi + + and m 
construct an unbiased estimator for the Stokes parameters x — (x 1 , x 2 , x 3 ) as 

mt — ttiT, 
x»:=— * /xG {1,2,3} 



fj_ — n»„ -r n»„ for /i 6 {1, 2, 3}. We can 

(2) 



We shall call this estimator a tomography in this paper. Note that the tomography can be regarded 
as a maximum likelihood estimator. In fact, since the probability distribution for the outcomes ±1 
of the fith PVM 

V 1 



(3) 



applied to the state t x £ S is given by 
outcome of the tomography is 



ilfW 



), the probability distribution for the 



: (1 + a; 1 , 1 - x 1 , 1 + x 2 , 1 - x 2 , 1 + a; 3 , 1 - x 3 ). 



(4) 



As a consequence, the likelihood function for the outcomes (wJ)i< M <3 obtained by m applications 
of M< T ) is 



3 ^ / _|_ 
ZmO) = 51 ( m M l0g g + m ^ 



6 



and it is easy to see that gfjr'm = is equivalent to 

In order to investigate the optimality of the tomography, let us recall some basic facts from 
quantum parameter estimation theory. Let {pg | 9 = [6 , ...,0 ) £ 0} be a smooth parametric 
family of density operators on a Hilbert space H with parameter space C M. d . An estimator is 
represented by a pair (M, 0) of a POVM M £ M.(H) and a map 6* : N — > 9 that gives the estimated 
value 9{n) from each observed data n £ N. An estimator (M, 9) is called unbiased if 



£7 fl [M, 19] := ^ S{n) Tr p e M n = 9 



(5) 



is satisfied for all 9 £ 8. An estimator (M, 9) is called locally unbiased 1 at a given point 9q £ Q 
if the condition (JSJ) is satisfied around 9 — 9q up to the first order of the Taylor expansion. It is 
well known that an estimator (M, 9) that is locally unbiased at 9q satisfies the following series of 
inequalities [TJ [5]: 

^ o [Af,0]>( 5eo (A/))- 1 >(J ( , n )- 1 , (6) 

where Vg[-] denotes the covariance matrix, and gg(M) is the classical Fisher information matrix at 
9 with respect to M £ M(H) defined by 



g e (M) r- 



E 



j&Tr Pe M n )(jfcTr Pe M n ) 
Tr p e M n 



l<i,j<d 



Further, Jg is the quantum Fisher information matrix at 9 given by 









Je ■= 




l<i,j<d 



-Tr pe(LiLj + LjL,) 



l<i,j<d 



2 There are possibilities that x ^ X. However it follows from the law of large numbers of the tomography that 
i£l for sufficiently large m almost surely. 
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where Li is the zth symmetric logarithmic derivative (SLD) defined by the sclfadjoint operator 
satisfying the equation 

d 1 

OQ-Pe = -j{Upe +PeU)- (7) 

The inequality Vg [-M, 0] > (Je )~ is called the quantum Cramer-Rao inequality. The first inequal- 
ity in © is saturated when 9 l (n) = 9 l + J^jidsiM)^ 1 ) 13 ^(logTr pgM n ) is adopted. However the 
second inequality in ([5]) cannot be saturated in general because of the non-commutativity of the 
SLDs. To avoid this difficulty, we often adopt an alternative strategy to seek the estimator which 
minimizes Tr Hg Vg [M, 9] , where Hg is a given d x d real positive definite matrix for each 9 called 
a weight [U [2] . Thus the problem of finding the optimal estimator boils down to the problem of 
finding M G A4(W) which minimizes Tr Hg gg (M)^ 1 . 

It is known that when dim"H = 2, there is a definitive answer to the optimality of estimators, 
which is summarized in the following Propositions. 

Proposition 1. For a given weight Hg, 

min {TrHggg(M)- 1 | M € M(H)} = (Tr Rg) 2 , (8) 



where Rg := y y J 1 Hg \ Jg 1 ■ The minimum is attained if and only if M £ Ai{H) satisfies 



m(M) = (9) 

Proposition [T] was first proved by Nagaoka [3] (cf. [1]) when d — 2. The case d — 3 is proved 
by Hayashi [5], and independently by Gill and Massar [6]. Further, Nagaoka constructed explicitly 
a measurement which attains the minimum when d = 2. His construction of an optimal estimator 
can be generalized as follows. 

Proposition 2. Given a weight Hg, let us diagonalize Rg as Rg — USU^ 1 where S = diag(Si, . . . , Sd) 
is a diagonal matrix and U € 0(d), and let AfW be a projection-valued measurement given by the 
spectral decomposition of the operator 



U:=Y,K lk L k , (10) 



k=l 

where K lk :— (U^ 1 \J Jg 1 )^ . Then the random measurement 

M := Pl MW ®---® Pd M^ (II) 
satisfies ©, where P i := Si/(S± + • • • + Sd)- 

Note that the optimal measurement (|TTj) depends on the true value of 9 £ O in general. In such 
a case, we necessary invoke an adaptive estimation scheme [5] to achieve the minimum 

Now it is natural to inquire whether the tomography is optimal in view of Propositions [T] and 
[2J The answer is given by the following. 

Theorem 3. Tomography is optimal if and only if the weight H x is proportional to the following 
special one: 



H<P := 



l-ix 1 ) 2 {l-(x 1 ) 2 )(l-(x 2 ) 2 ) (1-(x3) 2 )(1-(k 1 ) 2 ) 

(tt 1 )^ 2 ) 1 PE 2 )^ 3 ) 

(l-(x 1 ) 2 )(l-(x 2 ) 2 ) l-(x 2 ) 2 (l-{x 2 ) 2 )(l-(x 3 ) 2 ) 

, (x 3 )(x V ) (x 2 )(x 3 ) 1 

\ (1-(X 3 ) 2 )(1-(X 1 ) 2 ) (1-(X 2 ) 2 )(1~(X 3 ) 2 ) l-(^3) 2 



(12) 
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(T) 

Note that H x is not rotationally symmetric. This implies that the optimal weight depends on 
the choice of the coordinate axes. Theorem [3] also implies that the tomography is not optimal for 
a rotationally symmetric weight that is natural for a physical point of view. 

The paper is organized as follows. Theorem [3] is proved in Section [2j and the non-optimality 
of the tomography for a rotationally symmetric weight is discussed and numerically demonstrated 
in Section |31 An extension to the case when dim'H > 3 is also discussed there. For the reader's 
convenience, simple proofs of Propositions [1] and [5] are given in Appendix. 

2 Proof of Theorem [3] 

We prove Theorem [3] in a series of Lemmas. 
Lemma 4. Let be the SLD of for fj, e {1, 2, 3}. Then 

x^ 

Ln = On — — — (I — t). 

" M 2detr v ; 
Proof. We need only verify that satisfies equation (J7J. 



L,,t = o~„t t(I — t) = a„j /. 

p 2detr 2 



Therefore 



i(i„T + TL„) = I({ T ,c ( ,}_ I »J) = I({I/ |<7 J + {^ ;r< „„ <> }_ X »/) 

where {A,B} := AB + BA for A,B e C(H). □ 
Lemma 5. Let J x be the SLD Fisher information matrix at x. Then 

J x = (I-\x)(x\y 1 

c 1 

where \x) = I x 2 



Proof. We calculate the elements of J T 



Thus 



Then 



Jx = 1 + 4deT7 lx) {xl = 1 + lx) {xl ■ 



(I - \x)(x\) (j + ^ \x){x\^j = I + y^\x){x\ - \x){x\ - j^\x){x\ = I, 
where r = y/ (x\x). Therefore I + j^\x)(x\ = (I — |x)(a;|) _1 . □ 
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Lemma 6. Given F x € g x {M{'H)) with F x > 0. There exists a weight H x such that 



min {Tr fl^M)" 1 } = Tr tf.F" 1 



i/ and onZy i/ 

Further, when (I14p is satisfied, 



(13) 

(14) 
(15) 



Tr J- 1 ^, = 1. 



is the only weight which satisfies (|13[) where k is an arbitrary real positive number. 



Proof. We first assume that there exists a weight H x which satisfies (TIU1) . Let R x := \j \J J x 1 H X y J x 1 
According to Proposition [TJ F x must be 



Fr 



J X R X -\j J x 



so that 



Tr J~ l F x = Tr J; 1 ^^ = 1 ■ 



Tri? x 



Then we conclude (|14l) . 

We next assume that (fT"4"]) is satisfied. Let flj. = kF x J^F. It follows from Proposition Q] that 



min TriT^M)" 1 = Tr ^kVJ^F, J X 1 F X \fj^ 

= k (Tr J~ X F X ) 2 = fc (Tr J" 1 ^) 
= Tr (fef x J- 1 ^) F" 1 = Tr /TjF" 1 . 

Further, the weight of the form (fT"5|) are the only weights which satisfy (fT3f because the mapping 



M (1) (d,R) 9 



" Tri?, 



€g x (M(H)) 



Tr \/ V V Jx 1 



is injective where M^ x \d, K) := {G | G is d x d real positive definite matrix, TrG = 1}. 



□ 



Proof of Theorem [3j We can calculate the classical Fisher information matrix with respect to 
from g]) as follow: 



l-(x!) 2 






1 





(16) 



Then 



Tr J- 1 g x (M^) = Tr -(J - |r><r|) 



l-(xi) 2 u 

1 










1 

(x 1 ) 2 



Or 2 ) 2 (x 3 ) 



:S\2 



3 V 1 - (a; 1 ) 2 l-(x 2 ) 2 1 - (x 3 ) 2 1 - (x 1 ) 2 1 - (a; 2 ) 2 1 - (x 3 ) 5 



■5 




We see from Lemma [6] that H x := k g x (M^) J x x g x (M^">) are the only weights which satisfy 

^jnin {TvH^^Ny 1 } = Uff t &(JfW)-i. 



Then 



kg x (M^)J x 'g x (M^) 

= kg w (MW) (I-\x)(x\)g x (M^) = k{g x {M^f - g x (M^)\x)(x\g x (M^)) 



9k 



( 



\ 



l-(xi) 2 

(^)(z 2 ) 

' (l-(a; 1 ) 2 )(l-(a; 2 ) 2 ) 
(l-(^) 2 )(l-(^) 2 ) 



' (l-(x 1 ) 2 )(l-(x 2 ) 2 ) 
1 

l-(a; 2 ) 2 
(:r 2 )0 3 ) 
"(l-f^Kl-^ 3 ) 2 ) 



' (l-O 3 ) 2 )^-^ 1 ) 2 ) \ 

(* 2 )(* 3 ) 
(l-0 2 ) 2 )(l-(a; 3 ) 2 ) 
1 

l-(x a ) 2 



9kHl T \ 



□ 



3 Discussions 

Let us investigate the properties of the weight H x that is optimal for the tomography. We first 
regard a weight H x as a metric tensor on the tangent space T Tx S at x £ X, and let us plot the 
indicatrix, the set of end points of tangent vectors v £ T X S centered at x satisfying t vH x v = 1. 
Figure Q] shows the indicatrices on the a: 1 a; 2 -plane for H x = H X T ^ (left), H x = J x (middle), and 
H x = I (right). Obviously H x is not rotationally symmetric, and is awkwardly distorted when 
x = (a; 1 , a; 2 , a; 3 ) € X is off the coordinate axes. This means that the tomography depends highly 
on the choice of the coordinate axes. Actually, an estimation scheme should be independent of the 
choice of the coordinate axes because their choice is completely arbitrary. It is therefore natural to 
adopt a rotationally symmetric weight H x which satisfies U*H(rj x \U = H x for U £ SO(3). 
Any rotationally symmetric weight can be represented by 

:=f(r)I+(jg(r)-f(r))±\x)(x\, (17) 

for x where /, g are functions on (0,1) such that f(r) > and g(r) > (see Appendix [Bj. 
Given a weight H x — H x , let M^' 9 ^ £ M.{7-L) be the corresponding optimal measurement given 
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Figure 2: The behavior of c rv (solid) and c rv (dashed) as functions of radius r in the direction 
v = -Wl, 1, 1)* for H { J' a) = J x (left) and H x Lg) = I (right). 



by (HU), and let c x := Tr H x ^' 9 ^g x (M V >») ) _ 1 and c [ P := Tr H x f ' a) g x (M^)~ l , It then follows from 
© and (Unj) that 



/ 7-1 Tj(f,a) ./ 7-1 



+ V(i-^ 2 )ffW) 2 ; (is) 



and 

4 T > = 3(2/(r) + (1 - r 2 )g(r)) + 3tr 2 ( 3 (r) - /(r)), (19) 

where £ := 1 — ^ ^ + ^ ' . Note that < t < |, and that t = if and only if x is on one of 

the coordinate axes, and t = | if and only if a; is parallel to one of the vectors (1, 1, 1), (—1, 1, 1), 
(1, — 1, 1), and (1, 1, —1). In addition, 



JT) 



c x = 2 (V (1 - r*)g{r) - VTM) ' + 3r 2 ( 5 (r) - f(r)) t (20) 
= 2 ( ^(1 - r 2 )/(r) - VsM) 2 + 3r 2 (/(r) - g(r)) (| " *) • (21) 

By considering the cases <?(r) > /(r) and /(r) > g(r) separately, we conclude that > c x for 
any rotationally symmetric weig ht H x f ' 9 \ 

For example, when H x ^' 9 ^ — J x , for which f(r) = 1 and g(r) = ttzs, we see that g(r)—f(r) — > oo 
as r — !• 1, so that ci T ^ becomes much larger than c x . On the other hand, when H^' 9 '* = I, for which 
f(r) = g(r) = 1, the second terms in (f2"U|) and (|2"TT) vanish, and the difference Cx — c x becomes 
relatively small. Figure shows the behavior of c rv (solid) and c£) (dashed) as functions of radius 
r in the direction v = -^(1, 1, 1)' for H ( J' 9) = J x (left) and H [ J< 9) = I (right). When H ( J' 9) = J x , 

we see that c£) diverges as r — > 1, while c rl , converges to 9. When H^' 9 ^ = I, on the other hand, 

(T) 

Crv and c rv converge to 6 and 4 respectively asr-> 1, and their difference is relatively small. 

Now let us make a numerical simulation to compare the asymptotic performance of the tomog- 
raphy and the optimal adaptive estimation schemes for H x = J x and H x — I. We set the qubit 
state to be estimated as t Xq with xq — (0.55,0.55,0.55). Since the optimal estimator given in 
Proposition [5] depends on the true value of x G X, we shall invoke an adaptive estimation scheme 
in evaluating Tr H x g x {M{x))~ 1 , with M(x) being the optimal POVM for follows [HE]: 

We begin by choosing G X arbitrarily. Suppose that M(x^) is applied and that the outcome 
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Figure 3: A numerical comparison between the tomography and the optimal adaptive estimation 
for the weight H x , where H x has been set as H x — J x (left) or H x = I (right). The solid and dashed 
curves correspond to the adaptive estimation and the tomography, respectively, and the solid and 
dashed horizontal lines correspond to the theoretical limit. As a figure of merit, we have plotted 
the sample averages of 2m x B(t Xo , t^o.)) or m x \xq — x^ m ^\ 2 instead of m x Tt H Xo V[x^ m ^]. 

ri\ 6 {1, 2, . . . , s} is obtained. The maximum likelihood estimator is given by 

SP~' :— argmaxZi(x), 

x<£X 

where 

h(x) :=logTrT(x)M ni (i( )). 

At the mth stage (m > 2), suppose that M(x' m_1 - ) ) is applied and that the outcome n m € 
{1, 2, . . . , s} is obtained. The maximum likelihood estimator at the mth stage is given by 

a^™) := argmaxZ m (a;), 

x<£X 

where 

l m (x) ^^logTrr^M^C^- 1 )). 

i=l 

Because of the strong consistency and the asymptotic efficiency of the adaptive estimation [5] , the se- 
quence m x TtH Xo V[x^} of the weighted covariances multiplied by m converges to Tr H XQ g Xo (M(xo)) 
as m — > oo. Let us demonstrate this behavior by a numerical simulation. We have performed two 
kinds of numerical simulations in which the weight H x has been set as H x — J x and H x = I. These 
results are shown in the left and the right figure in Figure [3l where the solid and dashed curves 
correspond to the adaptive estimation and the tomography, and the solid and dashed horizontal 
lines correspond to the theoretical limits. As figures of merit, we have plotted in Figure|3]the sample 
averages of 2m x S(r Xo) r £ (m)), where B(- : ■) is the Bures distance, or m x |xo — i^ m '| 2 instead of 
m x Tr Ja; V[x' m '] or m x Tr V[i^ m ^] because they are asymptotically equivalent (See Appendix ICj) . 
The sample averages are calculated by repeating the estimation schemes 1000 times. We see that 
the sample average of each estimation scheme approaches the corresponding theoretical value, as 
m becomes large. We further observe that the adaptive estimation scheme is more efficient than 
the tomography, and the difference of their performances is noticeable when H x — J x . We could 
conclude that the tomography is not efficient for a rotationally symmetric weight that is natural in 
estimating an unknown qubit state. 

Finally we shall touch upon a generation to a higher dimensional Hilbert space T~L. Let q = 
dim'H(> 3) and let {|e^ Q ' ) )}' =1 be an orthonormal basis for each a = l,...,q + 1 satisfying 
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Figure 4: The behavior of c rv (solid) and c^} (dashed) as functions of r in the direction v G R g2 1 
where v\\ = 1 and v a i = (a 7^ 1 or i ^ 1) for dimH = 3 (left) and dim'H = 4 (right). 



\(4 \ e j H = f ( a ^ P) for a11 *>.?'• A nnite subset i\4 )}<*,i of the Hilbert space H is called a full 
set of mutually unbiased bases. It is known that a full set of mutually unbiased bases exists when 
q is a prime number or the power of a prime [9] . As before, we regard the uniform combination 

M (T) _L_ X M w 

9+1^ 

a— 1 

of the PVMs Af( Q ) := (\e[ a) )(e[ a) \, ■ ■ ■ , |e^ Q) )(e^ Q) |) G M{H) as a tomography on U. Let S be the 
set of strictly positive density operators on and let x = {x a ,i} be an affine parametrization of S 
given by 

^ a=l 4=1 * 

Figure 0] shows the behavior of c rv (solid) and Gh) (dashed) as functions of r in the direction 
2 -, 

where 

c x := min{Tr ^(M)" 1 | M G 
ci T) := Tr U^)" 1 

with = 1 and = (a ^ 1 or i ^ 1) for dimH = 3 (left) and dimW = 4 (right). We see that 
the behavior for dim T~L — 3 and 4 are almost the same as that for dim T~L — 2 plotted in Figure [5J 
This observation suggests that the same non-optimality result would hold for dim'H > 3. 
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Appendices 

A Proofs of Propositions [L] and [2] 

In this appendix, we give simple proofs of Propositions [T] and [5] for the reader's convenience. We 
start with some lemmas which hold for an arbitrary finite dimensional Hilbert space W. Let us 
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define the inner product (•, -) g on £(%), as 



(A, B) g := -Trp e (A*B + BA* 



Then we can rewrite ge(M) by SLD as follows 

g e (M) 



v (Lj, M x ) e {L 3 , M x ) t 



l<i,j<c 



Further we can also rewrite Jg as Jg — [(Li, Lj)]ij. Let us define L l as (JTDJ). Let us define 



gg(M) 



E 



l<ij'<c 



Lemma 7. U {7} is orthonormal with respect to (■, •}, 

Proof. 



Further 



s,t 



then 



L\ I) =Y,K is (L s , I) e = Y,K ls Ti Pe L s = Y,K is Ttd iPe = °> 



(I, I) e =Tvpg = l. 



□ 



Lemma 8. /£ ZioZrfs i/iai 

/or all M EM{H). 
Proof. 



Tr gg(M) < dimH - 1, 



v f (M x ,M x ) e \ 

* M (I, M x)e ~ (/ ' M ^J 



_ / d (L\ MX + (I, M x ) 

E E~ rhr, --a,*.). 



V 



(I, M x ) 



E 



(M x , M x 



1 < E Tr Mx ~ 1 



= TrJ - 1 = dimH - 1. 
Inequality (|2"2"j) follows from Bessel's inequality, and inequality (L?5T) from 

(I, M x ) 6 Tr M x = (Ttp e M x ) (Tr M x ) > Trp 9 M| = (M x , M,) 6 



(22) 
(23) 



□ 



Lemma 9. Let gg{M{Ti)) := {gg(M) \ M e M(H)}. Then gg(M(H)) is a convex set. Similarly, 
gg(A4(H)) is also a convex set. 
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Proof. Let M^\M^ G M(U) and let < p < 1. Then we see 
ge{pM^ © (1 - P )M^) t] 

p*(L i ,MP) g (L j ,MP) g (l- P r(L i ,M^) e (L j ,M^) t 



p(l,Mi 1) ) fj v (i-p)(i, M^)^ 

L u M^) n (ij, MP) (L h M^) (Lj, Mf ) 



I, MP) s y (/, 

= pge(M^) l] + (l-p)ge{M^) l3 . (24) 
This implies that any convex combination of gg(M^) and gg(M^) belongs to ge(A4(H)). □ 
Now we restrict ourselves to the case when dim'H = 2. In this case it is necessary that 1 < d < 3. 
Lemma 10. Given v = {v\, . . . , VdY G M. d such that \v\ — 1, then 

gg(M^) = \v)(v\, (25) 
where M^ ' is a projection-valued measurement given by the spectral decomposition of L v := 

Proof. 



(v\g B (M^)\v)=J2J2 v ^ 



T 



< (L v , L v ) e = 1, (26) 



where := / J (m^\ M^j. Because {M x ^}x is orthonormal with respect to (•, -) e , the 

inequality (|2"6")l follows from Bessel's inequality. Further by definition, L v e spanlM^}*. Therefor 

(v\g e (M^)\v) = I. (27) 

According to Lemma [SJ 

Tvge{M [v) ) < dim"H - 1 = 1. (28) 
We can conclude ([13]) from (J27J) and and g e (Af (,;) ) > 0. □ 

Lemma 11. Let M + (d, R) 6e i/ie sei of d x d real positive semi definite matrices. Then 

ge{M{%)) = {G G M + (d, M) | Tr G < 1 } . 

Proof. According to Lemma [TU1 for any v = (ui, . . . ,VdY G K d such that |v| = 1, 

\v)(v\eg e (M(H)). 

We further observe that € gs{M.{H)) because the POVM := (I) provides no information. 

Then we see from Lemma [5] that 



ge(M(H))Dco(\\v){v\ 



E R d , \v\ = lj U {0}) = {Ge M + (d, M)| Tr G < 1} 



The converse inclusion follows from Lemma [5] □ 
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Lemma 12. 



Proof. 



G G M+(d,R),TrG < lj 



thus 



(M) = U- 1 Jjg 1 g B {M)Jjg 1 U. 



JeUge(M)U- 1 ^J e =98(M). 



Therefore 

It follows from lemma [11] that 
ge{M{%)) = ^^TgUGU-'^Te G G = j^G^ 



G G M+(d,R),TrG < 1 



□ 



Lemma 13. Given S G M + (eJ,R) suc/i i/iat 5 > 0, 

min { Tr 5G _ 1 ; G G A/+ (d, R) , Tr G = 1 } = (Tr Vs) 2 . 
Or% i/G = VS/{TtVS) then Tr5G _1 = (Tr^) 2 . 

Proof. For G = (5y)i<, <d i let : = Tr (S'G _1 ) + A(Tr G- 1) where A is a Lagrange multiplier. 
Then 



df 

dG» 



Tr 



Si-G-'^G- 1 ) 

OLrij 



+ \5 i:j = -(ejlG^SG-^a) + XS i:j = 



where {e t } 1<l<d is the standard CONS of R d . Thus 

G^SG- 1 = XI 

from which 

Vs 



G = 



Vx 



and 



because of Tr G = 1 . As a consequence 



A = f Tr V~S 



minTr (SG- 1 ) = Tr (AG) = A = (Tr Vsf 

G 



Proof of Proposition [TJ According to Lemma IT2l and Lemma [TBI 



□ 



Tr i? e ge(M) = min{Tr Hg^j JgG^V J e I G e M+(d,M), TrG = 1} 



{Tr W Jg 1 H e J J g ~ 1 G~ 1 I G G M + (d,R), TrG = 1} 
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When Tr Hg gg(M)~ 1 achieves the minimum, 

Re 



G 



TvRe 
thus 

9e{M) = y/J e G\/Je = 



TrR e 

□ 

Proof of Proposition [2j Assume that d = 3. According to (IM|) and Lemma [TU1 

ge{M) = ^/T e Ugg(M)U- 1 ^/Tg = ^eU{ Pl g e {M^) + p 2 g {M^) + p^M^U- 1 ^7 e 
_ ( 1 \ / \ / o o o _ 
= y/ftUfai +pa 1 +p 3 ] jU^y/Tg 
\ J \ J \ 1 

j gU J-. U -i y /T g = y /j g E^L y /T g = ^ReVTe 



Tr5 v v Tr5 v Tri? e 

When d = 1 or 2, we can prove this in a similar way. □ 

B Rotationally symmetric weight 

In this appendix, we show that any rotationally symmetric weight is represented in the form 

H x := f(r)I + (g(r) - f(r))hx){x\, (29) 



for x 5^ where /, g are functions on (0, 1) such that f(r) > and g(r) > 0. 

TCP3 „,; + V, o„ — 

|x| 



Given x € X (x ^ 0) arbitrarily, let ei, e2, e3 be an orthonormal basis of R 3 with e 3 = j^j, and 



let V € 50(3) be any rotation about e3-axis. Since 

V*H X V = V*H (Vx) V = H x , (30) 

H x and are simultaneously diagonalized, and e% is one of their common eigenvectors. Other 
eigenvalues of H x must be degenerate because V is any rotation about e3-axis. Then H x should be 
represented as 

H x = f(x)\ei}(ei\ + f(x)\e 2 )(e 2 \+g(x)\e 3 )(e 3 \ 

= f(x)I + (g(x)-f(x))hx){x\. (31) 

r i 

Let U € 50(3) be any rotation. It follows that 

1 1 

U 



U*H (Ux) U = U* 



f{Ux)I + (g(Ux) - f(Ux))±\Ux){Ux\ 



= f(Ux)I + {g(Ux) - f(Ux))^\U*Ux)(U*Ux\ 

= f(Ux)I + (g(Ux)-f(Ux))^\x)(x\. (32) 

We see that it follows f(x) = f(Ux) and g(x) = g(Ux) for any U G 50(3) by comparing (j3Tj) and 
Therefore / and g must be represented by f(x) = /(|a;|) and g(x) = g(\x\). 
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C Bures distance and quantum Fisher information matrix 

The Bures distance between two states p and a is denned by 

B(p,a) := 4 - Tr yVWp) • 

It is known that 

B(T x ,T x+dx ) = \J2 ■>■■■■ 'I-'" + OQdx\ 3 ) (33) 

when \dx\ is sufficiently small. Given an estimator {M, x) that is locally unbiased at xq € X, it 
follows from (|3"3")l that 

TV J X0 V X0 [M,x] = E X0 [M,J2j^,ij(.^ ~ 4)@ ~ 4)} 

ij 

= E X0 [M, 2B(t X0 ,tz) + 0{\x- x \ 3 )}. 
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